per 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 

International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification: 
Not classified 



A2 



(11) International Publication Number: 
(43) International Publication Date: 



WO 92/03905 
19 March 1992 (19.03.92) 



(21) Internationa] Application Number: 

(22) International Filing Date : 2Se 



PCT/SE91/00585 
• 1991 (02.09.91) 



(30) Priority data: 

576^216 



31 August 1990 (31.08.90) US 



(71) Applicant: AB VOLVO [SE/SE]; S-405 08 Gdteborg (SE). 

(72) Inventor: WEDELIN, Dag ; IskMlareliden 9 A, S-416 55 

Gdteborg (SE). 

(74) Agent: BERGQUIST, Gunnar; AB Volvo, Teknisk Utveck- 
ling, Patent Department 06820, HCBVN, S-405 08 Gdte- 
borg (SE). 



(81) Designated States: AT (European patent), BE (European 
patent), CH European patent), DE (European patent), 
DK (Europe. . s patent), ES (European patent), FR (Eu- 
ropean patent), GB (European patent), GR (European 
patent), IT (European patent), JP, LU (European pa- 
tent), NL (European patent), SE (European patent). 



Published 

With declaration under Article I7(2Xa). 
Without classification and without abstract: tide not 
checked by the International Searching Authority. 



(54) Title: A METHOD AND APPARATUS FOR OPTIMALLY ALLOCATING RESOURCES 



Best Available Copy 



FOR THE PURPOSES OP INFORMATION ONLY 



Codes used to identify States party to the PCT oo the front pages of pamphlets publishing international 
applications under the PCT. 



AT 


Austria 


es 


Spain 


MG 




AU 


Australia 


Ft 


Finland 


ML 


Mafi 1 *"** 1 


BB 


Barbatlos 


PR 


France 


MN 


Mongolia 


BE 


Belgium 


CA 


Gabon 


MR 


Mauritania 


BP 


Burkina Faao 


Cfi 




MW 


Malawi 


BG 


Bulgaria 


CN 


Guinea 


NL 




BJ 


Benin 


CR 


Greece 


NO 


Norway 


BR 


Brazil 


BU 


Hungary 


PL 


rotano 


CA 


Canada 


rr 


Italy 


RO 




CP 


Central African Republic 


jp 


Japan 


SO 


Sudan 


OG 


Congo 


KF 


Democratic PcopJe*5 Republic 


SB 




GH 


Jn ■ ■! 




of Korea 


SN 




a 


Cdtedlxoitc — 


KR 


Republic of Korea 


SO* 


Soviet Unk 


CM 


Cameroon 


U 




TO 


Chad 


cs 


Czechoslovakia 


U£ 


Sri Lanka 


TC 


Togo 




Germany 


LU 




US 


United Stai 


DK 




MC 









+ Any designation of "SU" has effect in the Russian Federation. It is not yet known whether 
any such designation has effect in other States of the former Soviet Union* 



WO 92/03905 



PCT/SE91/00585 



1 



A METHOD AND APPARATUS FOR OPTIMALLY 
ALLOCATING RESOURCES 



10 



15 



20 



25 



30 



BACKGROUND OF THE INVENTION 
Combinatorial optimization includes well known 
modelling problems such as the travelling salesman 
problem, real world problems such as scheduling problems 
and various pattern recognition problems (sometimes 
called constraint satisfaction problems) . In 
particular, many common industrial resource allocation 
problems can be defined as combinatorial optimization 
problems. The task is usually to minimize cost given a 
number of constraints that describe the restrictions of 
the particular situation. In some cases the variables 
whose values are to be chosen are continuous (such as 
distance, weight, etc.), and then techniques such as 
linear programming are used. Often however, the 
entities to be determined are discrete, due to the fact 
that people and machines are indivisible, and cannot be 
assigned in fractions. Because of the combinatorial 

# 

explosion, such problems are extremely difficult to 
solve optimally within a reasonable amount of time and 
they can often be proved to be NP-hard. For small 
problems exhaustive search may be possible, but for 
larger problems this is not possible. Certain problems 
may belong to a special case or allow for decomposition 
into smaller problems. 

Examples of established methods are local 
optimization, selective combinatorial search, heuristics 
and integer linear programming. Hybrid methods are also 
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sometimes used. Some methods, such as integer linear 
programming, emphasize the search for optimal solutions. 
Other methods, such as heuristics and probabilistic 
relaxation, try to find good but possibly suboptimal 
solutions. This is the most suitable approach for very 
large problems, since exact methods often fail to find a 
solution at all, while in practice it is often 
satisfactory to find suboptimal solutions. 

Integer programming methods, which seem to be the 
most favored current approach, differ from intelligent 
search methods in the sense that they relax the integer 
constraints and instead work, with real valued, 
continuous variable. By doing so, linear programming 
techniques may be used to give continuous approximations 
to the combinatorial problem, from which integer 
solutions may sometimes be deduced. Examples of linear 
programming approaches can be found in U.S. Patent Nos. 
4,744,026, 4,744,027, 4,744,028, and 4,914,563. 

Another possibility is to vise the approach of 
neural or connectionist networks. There exists a wide 
variety of neural net architectures , although they all 
share some basic characteristics . The computation is 
distributed over a large number of nodes in a network, 
with each node working independently of the other nodes, 
only exchanging information with its neighbors in the 
network. The topology of the network may or may not 
reflect the specific problem to be solved. In some 
cases, the variables cure allowed to vary continuously. 
In other networks, such as Boltzmann machines, the 
variables are discrete, and the network finds a solution 
by stochastically moving between different states guided 
by a method called simulated annealing. 

The present invention is designed to provide a 
faster, more efficient approach to solving combinatorial 
optimization problems. 
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SUMMARY OF THE INVENTION 
The probabilistic network to be described is based 
on a technique called probabilistic relaxation. The 
term "probabilistic" is motivated by the use of numbers 
between 0 and 1 which are used and ■ interpreted more or 
less as probabilities. The assignments are found by an 
iterative procedure beginning with an initial guess 
where each node is assigned a number between 0 and 1 
indicating an initial preference as well as an 
uncertainty. The iteration proceeds by adjusting the 
node values with respect to the constraints until the 
numbers converge to 0 or 1 giving the final assignment. 
In the iteration step, the constraints are checked many 
times over, in sequence or in parallel. Each check 
changes the values for the variables affected by the 
current constraint and makes them consistent with this 
constraint. This is done disregarding other constraints 
that are therefore "relaxed". The local and parallel 
characteristics of the iteration makes it possible to do 
the computations in a network, emphasizing the structure 
of the computation. 

To solve a combinatorial problem with this 
approach, the appropriate variables and constraints must 
be specified, together with an objective function to be 
minimized or maximized. The network for the problem is 
uniquely defined by these variables and constraints. 
Each node in the network corresponds to a variable and 
each arc corresponds to a constraint, so the topology of 
the network directly reflects the structure of the 
problem. The network is iterated until it reaches a 
stable state, giving the proposed solution. If the 
network does not contain cycles, it will be able to 
solve the problem optimally in a time proportional to 
the size of the network. In almost all cases, however, 
the network will contain cycles and will in general find 
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suboptimal solutions. Special techniques have to be 
applied to ensure that the network reaches a stable 
state and does not continue to oscillate forever* 

The actual computation that the network according 
to the present invention performs is to compute (or 
approximate) the marginal distributions of a probability 
distribution over n discrete finite variables 

Jt-i 

where oc denotes proportionality. Each potential g*(XJ 
is determined by a constraint and X k is the subset of 
variables active in the constraint. This distribution 
is constructed so that the solution of the problem can 
easily be determined from the marginal distributions. 

An important feature of probabilistic relaxation is 
its ability to find solutions which agree well with the 
constraints without explicit search of the space of 
possible -solutions. Contrary to previous probabilistic 
relaxation approaches, this network is not sensitive to 
initial assignments and the order of iteration. Since 
the network is not restricted to finding an optimal 
solution, it is able to give good solutions to very 
large combinatorial optimization problems (with several 
million variables) , that cannot be solved by existing 
methods. The network is fast also when it simulated on 
a standard computer. It also allows direct use of 
constraints that include more than two variables, in 
contrast to existing neural network methods. 

In a preferred embodiment of the present invention, 
a method of optimally assigning available resources and 
available facilities is provided comprising the steps of 
creating a network of nodes and arcs, the arcs 
connecting various ones of the nodes, where the nodes 
correspond to assignments of a resource and a facility 
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and the arcs correspond to constraints on the 
assignments, iteratively performing the following steps 
until a stopping criteria is reached representing that 
the network is in a stable state: 

calculating inputs to the arcs r^fxj 
disregarding the old outputs from the arcs s a (x i ) ; 

calculating the influences on the nodes s^fxj 
based on . the inputs to the arcs r a (x;.; ; and 

calculating the new values for the nodes p(xj 
based on the influences on the nodes s Sk (x i ) , and 
allocating the resources and facilities in accordance 
with the stable state reached by the network. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention of the present application will now 
be described in more detail with reference to the 
preferred embodiments of the apparatus, given only by 
way of example, and with reference to the accompanying 
drawings, in which: 

Figure 1 is a schematic illustration of the basic 
components in the network; 

Figure 2 illustrates the fixation method for 
convergence ; 

Figure 3 illustrates a crew assignment problem 
according to the present invention; 

Figures 4A and 4B are a flowcharts illustrating a 
module for establishing a set of pairings in accordance 
with the present invention; 

Figure 5 Is a flowchart illustrating a module for 
resource allocation in accordance with the present 
invention; 

Figure 6 is a block diagram illustrating an 
apparatus according to the present invention; and 

Figure 7 is a flowchart of the method according to 
the present invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The theory on which the present invention is based 
is published in Wedelin, "Probabilistic Networks and 
Combinatorial Optimization", Report 49 , Programming 
5 Methodology Group, University Of Goteborg and Chalmers 

University of Technology , May 1989 , publicly available 
September, 1989. This paper is incorporated by 
reference herein. 

To be able to discuss the network in detail, first, 

10 a description of the kind of problem the network 

attempts to solve is in order. Further, the formulation 
of how optimization problems in this context will be 
discussed. Finally, the network according to the 
present invention will be described. Consider a special 

15 class of probability distributions over n discrete 

finite variables X = (x„ — ,xj . The variables x t will 
in the most important case be binary so that xfi{0,l} but 
may also take an arbitrary finite number of discrete 
values. The probability distributions will be of the 

20 form 

a 

P(X)« c Y[q k (X k ) 

Jc-i 

where each q t (X k ) is a nonnegative function of a subset X k 
of the variables x lr — ,x a . The functions g^XJwill be 
called the potentials of p(X) . The normalization 
. constant is . chosen so that p(X) satisfies 

5>u>« i 

X 

25 where a simplified notation is used to indicate that the 

summation should be performed over all possible values 
of configurations of X, in the binary case 2" 
configurations. The constant C is never calculated so 
the following equation can be used: 
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m 

pU)« J[q k (X k ) 



(2) 



where <x denotes proportionality. 

To describe the structure of p(X) it is useful to 
associate with p(X) an undirected hypergraph with n 
nodes and m arcs. Each node corresponds to one of the 
variables x- t and each arc corresponds to a potential q k . 
In a hypergraph an arc is allowed to connect to more 
than two nodes. Most other basic definitions generalize 
intuitively from ordinary graphs. 

In the present case, the arc corresponding to q k 
should be connected to all of the nodes in X k . Arc q k is 
then connected to node x { if and only if q k depends on 
the variable x,. For simplicity, an arc will also be 
allowed to connect to only one node if q k for some k 
depends on only one variable. 

A relevant property of this hypergraph is whether 
it contains any cycles or if it is acyclic. A cycle is 
a path over the arcs from a node back to itself through 
a sequence of other nodes. If the hypergraph contains 
no cycles it is a hypertree. 

The probability distribution p(X) is said to have a 
Markov property with respect to the hypergraph since 

4 

p(x i |-XVc i ) =p(x 1 |tf i ) 

where N t denotes the set of neighbors to x t . This means 
that the conditional probability for x t given all other 
nodes in the system depends only on those nodes that are 
neighbors to x t in the graph. 

These conditional probabilities are local 
properties of p(X) and are calculated from the 
potentials. In fact, p(X) may be defined by specifying 
the functions p (x t \ N t ) . 
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On the other hand, it is general extremely 
difficult to calculate global properties of p(X) such as 
the marginal probabilities over a subset of the 
variables. It is also difficult to generate random 
samples from p(X) . In particular, the marginal 
probabilities 

are difficult to calculate except in special cases. The 
summation should be performed over all configurations of 
the variables in X except x,. In principle, the 
calculation is trivial but it suffers from the 
combinatorial explosion when n is large. 

The network to be presented attempts to approximate 
these marginals. The network has the same topology as 
the graph for p(X) and gives exact answers in the 
special case when the graph is acyclic. For 
unrestricted graphs the approximate results that can be 
obtained turn out to be useful in practice. 

TRANSFORMATION OF THE PROBLEM 

To solve a combinatorial optimization problem with 
probabilistic relaxation, a function p' (X) is defined of 
the form (2) so that maximization of p' (X) , 

max p*(X) 

becomes equivalent to solving the optimization problem. 
Combinatorial optimization problems are usually stated 
in terms of an objective function to be maximized or 
minimized subject to a number of constraints. By 
choosing the potentials appropriately, a wide range of 
combinatorial optimization problems can be represented. 
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For example, if the problem involves maximization 
of a linear function c(X)=T,p?c i then each term in the sum 
contributes to a potential 

qixj = e c * x * 

in p' (X) . A constraint over a set of variables X k is 
conveniently represented by 

rr(v \ - I 1 if constraint satisfied 
m A k> ~ \ o otherwise 

which makes p' (X)=Q when the constraint is not 
satisfied. 

Now define * 

_i 

P(X) «p'(*) T 

where T is a positive "temperature" constant. The 
notion of temperature has been inspired by the ideas of 
simulated annealing (see Gemean et al, "Stochastic 
Relaxation, Gibbs distributions, and the Bayesian 
restoration of images", IEEE Trans. Pattern Anal. 
Machine Intell. 6, pp. 721-741 (1984)) although an 
annealing schedule is not used herein. In a preferred 
embodiment of the present invention, T is set to a value 
in the range of 10E-20 to 10E-1, that is, 
0.000000000000000000001 to 0.1). T is kept at a 
constant value throughout the iteration process. A 
change in temperature from 1 to T modifies the 
potentials from q r k (X k ) to 

q k (X k ) = q' k {X k )^ 

The function p(X) can be maximized instead of p' (X) 
without changing the problem. Choosing T<1 "sharpens" 
p(X) so that differences in magnitude become exaggerated 
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compared to those of p' (X) . If T is chosen to be 
sufficiently near 0 then the maximum value of p(X) will 
be arbitrarily near 1 and the sum of p(X) over all other 
configurations will be negligible since Ljp(X)=l. If the 
maximum of p(X) should not be unique it can usually be 
handled by a slight perturbation of the problem. This 
perturbation would be created by the addition of a 
random noise to the costs in the problem. The amplitude 
of the noise is about 1/10000 of the costs and does not 
affect the solution* 

Calculation of the marginal probabilities of p(xj 
will now solve the optimization problem. Since the 
maximum value of p(X) dominates over all other values , 
the optimal configuration can be read directly from the 
p(X;) by solving the trivial maximization problems 

max p(x i ) 

for each variable. In practice p(x s ) will be near 1 for 
the right answer and near 0 otherwise. 

* 

If the p(Xt) cure calculated directly from their 
definition in equation (3) the procedure is no better 
than the simple approach of testing the value of p(X) 
for all possible configurations. However, if the 
marginal probabilities could be calculated or 
approximated efficiently the situation would be quite 
different. 

THE PROBABILISTIC NETWORK 

The basic idea of the network is illustrated in 
Figure 1. In the network, the p(xj are entities in the 
nodes while the s ik (x i ) and r^fx^ constitute a kind of 
flow through the arcs. Each arc receives input r a (Xi) 
from all its nodes and produces output s^fxj to these 
nodes in accordance with the equation 
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s i*< x J ~ £ ft<jy n (4) 

If qtfXi) depends only on the single variable x ir that is 
qtCXjJ^q^xJ , this equation simplifies to 

S iJt (x i ) « ^Ui) (5) 

Each node calculates its value pfx,; from its inputs by 
the equation 

P(*i> ~ II *i*<*i> (6) 

k.X £ €X k 

and its output to the arcs by 

« 



Each node collects its influences s^fxj and keeps 
p(xj and r^xj consistent with equations (6) and (7) . 
The network can therefore be in a stable state only when 
it satisfies these equations* A change in the output 
SxfxJ from an arc to a node results in a change of the 
current value of p(xj for that node and also in the 
inputs r^xj to the other arcs. This may in turn cause 
changes for other nodes. All calculations are local in 
the sense that they involve only one arc and its nodes. 

In principle, the updating of the variables in the 
network could be performed continuously in time. In 
discrete time, one iteration of the network is defined 
as one update of all the influences in the network. It 
is also convenient to define a special initialization 
step that updates only influences of the simple form of 
equation (5) . This step involves almost no calculation 
since the values can be read directly from the 
potentials . 
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When the network is acyclic these formulas uniquely 
define the p(xj as the marginals of equation (2). 

THE ARC ITERATION 
5 An arc iteration updates the influences from one 

particular arc to all its nodes and may to a large 
extent be parallelized. The arcs can be updated in 
parallel with the exception that adjacent arcs should 
not be iterated at exactly the same time. 

10 The reason for defining the arc iteration is that 

the influences can be more efficiently calculated when 
they are calculated together in this way. In the method 
according to the present invention , the <x sign should be 
interpreted as an assignment operator- This assignment 

15 performs in dependence on the functions p(xj , r^fxj , and 

s^Xf) . Before the iteration begins, all the variables 
should be initialized to the uniform distribution (oc i) 
and, if possible, according to equation (5). 
For all k (in parallel) : 

20 l. For all i, x t eX k (in parallel) : 



2. For all i, x f eX k (in parallel): 

3. For all i, x t ex k (in parallel): 

pOq) « r iJt (x 1 )s iJr (jc 1 ) 

The above method according to the present invention 
accomplishes the following steps. Step 1 calculates the 
25 input Jr a (x t ) to the arc by dividing away the old output 

from the arc, that is, the old influences on the node, 
from the current value of p(Xf) . Step 2 does the actual 
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updating of the influences and step 3 calculates the new 
values of p(x $ ) . 

For a hyperarc, the most critical part of the 
calculation is the sum over all the configurations of 
X t \x r In general, this calculation* will be exponential 
in the number of variables in X k . For many simple 
functions g*(XJ , however , the time needed for 
calculation of the influences from an arc in step 2 can 
be made linear in the number of variables. One common 
case is when q k (XJ contains only relatively few non-zero 
values such as for example, a function that is 1 if and 
only if exactly one of the argument variables is 1 and 0 
otherwise. Other possibilities are that the function 
shows some kind of symmetry so that the sums may be 
calculated without explicit summation or that it can be 
written as the sum of other functions that are simple. 
As a last resort, one may even attempt to approximate 
the sums. 

FORCING THE NETWORK TO CONVERGE AND RESOLVE 

Whether or not the network converges depends 
entirely on the kind of problem to be solved. For a 
problem giving an acyclic network, convergence is 
guaranteed after a finite number of steps. For some 
other problems, such as the assignment problem, 
convergence is also reached in a finite number of steps. 
For other more difficult problems, there will be no 
convergence, or only partial convergence in the sense 
that a subset of the network becomes stable. In 
particular, when a subset of the network yields stable 
results in several successive iterations, the subset is 
said to have converged for purposes of the present 
invention. The number of iterations needed for 
convergence with the herein described methods depends on 
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the particular problem instance but typically lies in 
the range of fifty to several hundred iterations. 

The basic idea considered so far for making the 
network converge is to adjust the potentials for single 
5 variables , q(xj , as little as possible so as to give a 

simpler problem that does converge. This is equivalent 
to dynamic manipulation of the cost for each variable 
since 

- 

q{ Xi ) = e CjX ' 

■ 

The adjustment can be done in two basic ways, by fixing 

10 nodes to determined values (setting a very high or low 

cost) or by continuous adjustment. 

The first method decides a specific cost of some 
node based on preliminary information that can be 
obtained from the unresolved or oscillating network. 

15 For example, if a binary variable x i is fixed to 1 then 

the potential q(xj is set to q(Xj=l)=1.0 and q(Xj=0)=0.0. 
After this fixation of the node, the modified network is 
iterated in order to adjust for the change. This 
process is then repeated until all nodes are stable and 

20 resolved. The stabilizing effect of fixation is due to 

the fact that it eliminates the interaction through the 
fixed node since r^xj = q(xj for all k. This 
effectively splits the network at the fixed node into 
several separate nodes with the same value, see Figure 

25 2. The number of cycles is reduced and eventually after 

repeated fixation, the network will contain no cycles 
and convergence is guaranteed. This method also 
provides a means for resolving a converged but 
unresolved network and thus makes the introduction of 

30 random costs unnecessary. 

The other basic method to make the network converge 
is to change the potentials q(x-j continuously in small 
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steps after each iteration so that the belief in the 
current values of b(xj is strengthened , where b(Xj) is 
the result of the unrestricted network. This eliminates 
the possibility of arriving at an entirely false 
solution and is more in the spirit of the network 
approach. In a preferred method, this is accomplished 
by modifying equation (7) in step 1 of the method to 

r ik oc 



where ce<l. 

As noted above, the present invention is applicable 
10 to a number of real world problems in which it is 

necessary to match resources and tasks within a system, 
subject to constraints. One such application is in the 
field of airline resource management. Other 
applications of the present invention include the fields 
15 of pattern recognition, VLSI design, expert systems, and 

production planning. Although the present invention may 
be applied in these, and other, fields, the following 
discussion will focus on the airline resource management 
application. 

20 Once a flight table has been constructed, an 

airline has to make sure that each flight will be 
provided with a crew. The schedules must comply with 
the general safety regulations, the company's own 
internal operating policies and the union requirements, 

25 while minimizing total cost. These requirements for a 

valid schedule constitute the constraints in the 
network- The present invention can be used to determine 
crew pairing construction and crew assignments. 
Further, the invention can be used to assess future crew 

30 requirements by allocating crew rotations. 

One step of the present invention in solving the 
airline resource management problem is to assign 
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activities to the personel. This problem can be 
formulated by a matrix where persons are lined up 
against activities as shown in Table 1. 

TABLE 1 JOB ROTATION 







Rl 


R2 


R3 


R4 


Mr. 


Brown 


6 


5 


8 


7 


Mr. 


Green 


3 


8 


7 


10 


Mr. 


White 


4 


•* 

1 


5 


9 


Mr. 


Black 


5 


6 


4 


6 



10 

Each entry in the matrix corresponds to a possible 
assignment. Each assignment is given a gain g(i,j) and 
only one activity can be assigned to each person and 
15 only one person on each activity. By introducing 

variables x(l,j) , where x(±,j) = l if person i is 
assigned to activity j, and otherwise, 0, a formal 
statement of the problem can be expressed as follows: 



Subject to: 



£ x(i, j) = l for all i 

J 



xU'J) = 1 for all j 

i 



= lor0 



TABLE 2 JOB ROTATION SOLUTION 



Mr. Brown 



Mr. White 
Mr. Black 



30 



Rl 


R2 


R3 


R4 


6 


5 


8 


7 


3 


8 


7 


10 


4 


1 


5 


9 


5 


6 


4 


6 
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In the solution shown in Table 2, Mr. Brown is 
assigned to rotation 3, Mr. Green to rotation 2, Mr. 
White to rotation 4, and Mr. Black to rotation 1. 

x(l,3) - 1, x(2,2) = 2, X(3,4) = 2, X(4,l) = 2, all 
other x are zero. The total gain is 

g(l,3)+g(2,2)+g(3, 4)+g(4,l) = 8+8+9+5=30 
and the mathematical theory for linear programming 
indicates that this is optimal. 

A natural generalization of this problem is to 
allow more than one person on each activity. Thus, the 
"capacity" of the activity is increased. Additionally, 
it may be preferrable to limit the capacities on groups 
of activities. Activities may occur in many variants, 
out of which only a few have to be used. 



It might be an objective to "fill" all capacity 
constraints to their limits. As far as possible, all 
rotations are to be supplied with a crew. Therefore, a 
"slack variable" is introduced representing the 
remaining capacity of the constraint. If this variable 
is given a large cost in the objective, the method 
according to the present invention will work hard to 
assign persons until the slack is zero. 



C is a negative term in the objective and y is an extra 
variable. In order to obtain a solution with a high 
gain, y must be small. This guarantees that a lot of 
assignments are made. The values of the numbers C are 
entered in a way that reflects the desire to "pack" to 
schedule. A large negative C means dense packing. 




for all and all 
x(i,j) * capacity activity groups 

S. 



x{i,j) + C • y = capacity 



for all 



i.jes 

y > 0 



activity 
groups S. 
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In application of the method to real life problems, 
it may not be satisfactory only to assign more than one 
person to each activity. It may also be desirable to 
assign more than one activity to each person* However/ 
activities assigned to the same person must fulfill a 
number of requirements, that is, satisfy a number of 
constraints. The most fundamental of these requirements 
is that no activities assigned to the same person may 
occur at the same time. This provides a set of 
"collision constraints" : 



Each subset Sk contains activities colliding at some 
specific moment. Around seven such critical moments are 
usually sufficient to cover all collision possibilities 
in a 14 day schedule. 

Each activity has a number of "ingredients" whose 
distribution among the employees must be controlled. , 
For example, it may be desirable to ensure variation in 
each individual schedule by giving upper bounds on the 
number of stops at certain airports. Another 
restriction of the same type would be to require a 
minimum amount of flight time with a certain aircraft 
type. Of course, the total workload must also be under 
control. 

In order to take care of these requirements, 
general linear constraints are introduced with soft 
bounds in the system. These will have the form: 



The numbers a(j) correspond to the halt of the 
ingredient in activity j and Y is a variable allowing 
the bounds to be exceeded at a cost C. Y is called an 



£ xii.j) s 1 



for all persons 
i and all subsets s. 
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artificial variable. A large negative number C will 
push the solution into the inside of the bounds. 

Another factor to be considered is the requirements 
of union contracts which impose non-linear constraints. 
"Provided that a person is assigned an activity of type 
A, he should be allowed an activity of type B within two 
weeks." This type of rule can be handled by quadratic " 
combinatorial constraints. Individual costs are set on 
each of the four possible scenarios: 

1. Mr. X is assigned none of the activities; 

2. Mr. X is assigned an activity from A; 

3. Mr. X is assigned an activity from B; 

4. Mr. X is assigned activities from both A and B. 
A typical union contract would recommend case 1 or 4. 
Case 2 and 3 would be illegal. This is not linear. 

These various constraints, as well as other 
appropriate :*or the industry, are used in solving the 
resource allocation problem as follows. 

The crew assignment problem will now be discussed 

with respect to the iterative method of the present 
invention in connection with Fig. 3, where the rows are 
the people in the crews and the columns are the 
different flight rotations and activities. Only a few 
people and activities cure shown for simplicity. Each 
entry in the table corresponds to a node in the network. 
Arcs are formed between the entries which reflect the 
constraints placed on the system. For example, due to 
airline regulations, at least 3 people must be assigned 
to each rotation. Therefore, arc Al constitutes the 
constraint Exj>3. With respect to the constraints on the 
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rows, clearly, each person can only be in one place at a 
time, therefore EX|<1. Other constraints are determined 
based, for example, on union rules which proscribe how 
many days in a row a crew member is allowed to work, how 
much training he/she must have, employee preferences, 
vacations, etc. , as discussed above. 

The table of Fig. 3 relates to the equations (1) , - 
(2) and (3) in the following manner. In the 
aforementioned equations, the "circs" of the table of 
Fig. 3 are indexed k. The index k in these equations 
refers, for example, either to a constraint requiring 
that a person is assigned exactly n activities (arc A2 
in Fig. 3) , or a similar constraint requiring that each 
activity is assigned exactly m persons (arc Al in Fig. 
15 3). To understand the meaning of r^xj , s^xj , p(xj , it 

is assumed that k corresponds to an arc requiring that a 
person should be assigned exactly ji activities. The 
index i then corresponds to the activities. p(xj is the 
probability that person k should have the activity i. 
20 It is this number that is actually listed in the table 

of Fig. 3. s^xj is the influence on this probability 
from this particular arc. That is, the requirement that 
the person k should have exactly n activities has a 
certain impact on p which is stored in s. r^xj is the 
25 input from p to arc k where the influence s^x^ from arc 

k to p is cancelled from p. 

An apparatus for implementing the method according 
to the present invention will now be described. In a 
preferred embodiment of the present invention, the 
apparatus is embodied as a computer program adapted to 
run on a programmed computer. It is understood that 
various hardware circuitry may be substituted for all or 
parts of the computer program without departing from the 
scope of the invention. 



30 



5 



SUBSTITUTE SHEET 



21 

Fig. 4A illustrates a flow-chart of the APC 200 
module which uses the probabilistic relaxation network 
method according to the present invention. The APC 200 
module takes single flight legs as input and produces 
rotations as output so as to minimize the total cost of 
the rotations. A rotation is a sequence of flight legs 
starting and ending -at the home base of the crew. 

Fig. 4B is a flow-chart illustrating the basic 
components of the APC 200 module. The Enumeration 
Process 210 takes flights as input and produces 
rotations as output. Each rotation is then priced by 
the Pricing Process 220. The Pricing Process 220 uses 
actual costs for salaries, hotel rooms, ground 
transportation, etc. to determine the accurate cost of 
each rotation. The rotations are then piped to a Filter 
Process 230 which sorts out illegal rotations. A 
feedback loop 232 from the Filter Process 230 to the 
Enumeration Process 210 is used to control the 
Enumeration Process 210 and increase the ratio of legal 
to illegal rotations flowing from the Enumeration 
Process 210 to the Filter Process 230. The specific 
operation of the Filter Process 230 is application 
dependent and depends directly on the legality rules 
defined for the application. The CRC 240 reads a file 
from a memory storage area (not shown) where the 
legality rules are stored for the particular application 
and assembles the Filter Process 230. 

The rotations which pass the Filter Process 230 are 
piped to the Set Covering Process 250 which uses the 
probabilistic relaxation network method according to the 
present invention. Here, each rotation corresponds to a 
"node" and each flight is associated with an arc in the 
probabilistic network. The arcs q require that each 
flight have at least one crew. There is also a penalty 
for assigning more than one crew. 
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Since the enumeration process can not be complete 
(it would result in too many rotations) , the resulting 
rotations from the Set Covering Process 250 are examined 
by a Subproblem Selection Process 260 which selects a 
5 subset of rotations which can be improved. These are 

passed back to the Enumeration Process 210 which is 
complete for smaller subproblems. The enumeration 
process is complete when all possible rotations have 
been enumerated. When no improvement is encountered in 

10 several consecutive subproblem runs, the program 

terminates. The number of consecutive runs in which the 
stability occurs is application dependent as described 
above with respect to network convergence. At the 
termination of the program , the values of p(x { ) of the 

15 last iteration constitute the optimal or near optimal 

set of rotations based on the constraints of the 
industry . 

The resulting rotations may then be matched with 
the airline personel in a second program module shown in 

20 Fig. 5. This second program module also uses the 

probabilistic relaxation network method according to the 
present invention. This module inputs resources such as 
persons and activities (that is, rotations) and outputs 
assignments according to which persons are assigned to 

25 tasks. The resources are passed to a Node Generator 310 

which generates a node for each possible assignment. 
Nodes corresponding to assignments which are illegal are 
not generated. The nodes are passed to a Pricing 
Process 315 which takes personal preferences and costs 

30 as input and performs the determination of the 



Legality rules cure passed to the CRC Process 330 
which produces code for determining if a combination of 
assignments is allowed or not. This code, together with 
35 the nodes are passed to the Translation Process 340 
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which uses the various techniques to translate the 
legality rules to computable q- functions. The 
techniques will be described in detail later. The Arc 
Generator 350 then uses this information to feed the 
5 probabilistic network with arcs. As input, the Arc 

Generator 350 also takes information about resource 
limitations and some strategic information such as 
penalties on unfair schedules. This information results 
in more capacity and configuration arcs. The nodes and 

10 the arcs together constitute the probabilistic network 

which is passed to the Optimization Process 360. The 
Optimization Process 360 performs the steps (l)-(3) 
described above to obtain an optimal or near optimal 
assignment of personel to tasks. The optimal or near 

15 optimal assignments, determined from the values of p(xj 

of the last iteration, constitute an allocation of 
resources and facilities which minimizes the costs of 
such allocation. 

As shown in Fig. 6, certain legality constraints 

20 are input into Preprocessor 100, which then outputs 

capacity constraints and variables. These capacity 
constraints and variable are then input into the R 
Process 120. The Preprocessor 100 of Fig. 6 corresponds 
to the elements of Fig. 4 A that preceed the Set Covering 

25 Process 250 and the elements of Fig. 5 that preceed the 

Optimization Process 360. Correspondingly, the Set 
Covering Process 250 and the Optimization Process 360 

call a routine which implements the R Process 120 

* 

through the Stopping Criteria Process 150 of Fig. 6. 

30 The legality constraints consist of, for example, 

restrictions on work time etc. as described in union, 
government and company agreements as described above. 
These can be extremely complex from a mathematical point 
of view and must be transformed to a form that can be 

35 efficiently computed. "Efficiently computed" refers to 
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the situation where the q-f unction corresponding to the 
constraint should be possible to represent in such a way 
that step (2) of the three-step procedure discussed 
above can be performed in linear time. 
5 The capacity constraints are constraints limiting 

the capacity of a certain resource. For instance, the 
resource rotation 1 from Fig. 3 is limited by constraint 
(arc) Al to take no less than 3 persons. Mathematically 
this implicates that the sum of four binary variables is 

10 equal to or greater than three. A capacity constraint 

is thus a special case of a q-f unction that can be 
efficiently computed. Another type of q-function used 
by the apparatus of the present invention is the 
"configuration arc". In this arc, single configurations 

15 of variables can be associated with a cost. 

Depending on the specific application of the method 
and apparatus of the present invention, one or more of 
the following techniques cure used to translate legality 
constraints into "computable q-f unctions" . 

20 a new variable can be generated for each 

configuration of variables that is "legal" according to 
a set of legality constraints. This approach is 
appropriate when the set of legality constraints can be 
partitioned into subsets where each subset is connected 

25 to a small subset of the variables, since all 

configurations of the variable subsets can be listed. 
Each set of legality constraints in the old variables 

* 

will then give rise to one linear equality constraint in 
the new variables . 

30 Alternatively, complex legality constraints over 

many variables can be translated to a set of linear 
constraints over smaller groups of variables . For 
instance, in the assignment problem, all pairs of 
activities can be tested for legality. If an illegal 

35 pair is encountered, a constraint is added requiring 
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that only one of the two activities should be assigned 
to the same person. 

The third method takes into account that the 
machine can handle general nonlinear constraints. If 
step (2) of the iteration procedure can be efficiently 
computed, the legality constraint can be implemented 
directly as a q-f unction, with no translation. ■* 

The probabilistic relaxation network technique of 
the present invention is shown in the flowchart of Fig. 
7. Step 10 selects the initial values of the variables 
p(Xg) , that is, the entries in the table of Fig. 3 and 
sets up the constraints represented by the q-f unctions, 
that is, the relationships between the various entries 
in the table. This step is accomplished in the 
Preprocessor 100 shown in Fig. 6. 

Step 20 in the flowchart of Fig. 7, carried out in 
the R Process 120 of Fig. 6, calculates the input r^xj 
to the arc by dividing away the old output from the 
current value of p(xj . This step is carried out in 
parallel for all i. Step 30 in the flowchart of Fig. 7, 
carried out in the S Process 130 of Fig. 6, does the 
actual updating of the influences for each contraint. 
This step is also carried out in parallel for all i. 
Step 40 in the flowchart of Fig. 7, carried out in the P 
Process 140 of Fig. 6, calculates the new values of 
p(Xt) . This step is carried out in parallel for all i. 
A loop is performed from step 20 through step 40 for all 
k in parallel. Thus steps 20, 30 and 40 are performed 
for all k. 

Step 50 of the flowchart shown in Fig. 79 
determines whether the stopping criteria is satisfied. 
This is accomplished in the Stopping Processor 150 of 
Fig. 6. If the stopping criteria is satisf ied, a 
solution has been reached and the process ends, having 
determined where each crew member is to be assigned on a 
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given day. If the stopping criteria has not been 
satisfied, steps 20-50 are repeated. 

In the Stopping Processor 150, it is determined 
when no change in the solution is encountered during 
5 several consecutive iterations. When a static condition 

is found, the program stops. 

While the present inventors have constructed a new 
method for solving probabilistic relaxation problems, it 
is to be understood that the claims of this invention 
10 relate only to the application of this novel method to 

arrangements that determine the optimum allocation of 
resources in real world technological and industrial 
systems that lend themselves to a probabilistic 
representation of the variables and constraints 
15 characterizing the system, i.e., physical arrangements 

that determine how resources are actually applied to 
optimize the performance of processes, machines, 
manufactures or compositions of matter. All other uses 
of the new method, such as computation research, 
20 algorithm research, or probabilistic research 

activities, form no part of the present invention. 
Similarly, use of the new method in non-technological or 
non-industrial systems likewise form* no part of the 
present invention. 
25 The foregoing description of the specific 

embodiments will so fully reveal the general nature of 
the invention that others can, by applying current 
knowledge, readily modify and/ or adapt for various 
applications such specific embodiments without departing 
from the scope of the invention, and, therefore, such 
adaptations and modifications should and are intended to 
be comprehended within the meaning and range of 
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herein is for the purpose of description and not of 
limitation. 



SUBSTITUTE SHEET 



WO 92/03905 



PCT/SE91/00585 



28 

CLAIMS: 

1. A method of optimally assigning available 
resources and available facilities comprising the steps 
of: 

creating a network of nodes and arcs, said arcs 
connecting various ones of said nodes, where the nodes 
correspond to assignments of a resource and a facility 
and the arcs correspond to constraints on the 
assignments; 

iteratively performing the following steps until a 
stopping criteria is reached representing that the 
network is in a stable state: 

calculating inputs to the arcs r^fxj disregarding 
the old outputs from the arcs s^Xf) ; 

calculating the influences on the nodes s^fxj based 
on the inputs to the arcs r^xj ; and 

calculating the new values for the nodes p(x t ) based 
on the influences on the nodes s & (x t ) ; and 

allocating said resources and facilities in 
accordance with the stable state reached by the network. 

2. The method as set forth in claim 1, wherein the 
step of iteratively performing is performed for all k in 
parallel. 



3. The method as set forth in claim 2, wherein the 
inputs to the arcs are calculated in accordance with the 
formula, for all i,XfiX k in parallel: 



1 ' 
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4. The method as set forth in claim 3, wherein the 
influences on the nodes are calculated in accordance 
with the formula, for all i f x { eX k in parallel: 

5. The method as set forth in claim 4, wherein the 
new values for the nodes are calculated in accordance 
with the formula, for all i,x,eX k in parallel: 

6. A method of allocating available personnel to 
a plurality of tasks so as to effectively accomplish the 
tasks, the method comprising the steps of: 

determining a plurality of tasks to be filled by 
the available personnel; 

assigning costs for each of said tasks; 

determining a plurality of variables and influences 
on said variables in the form of capacity constraints, 
from said plurality of tasks and costs; 

iteratively updating the influences on said 
variables using a probabilistic relaxation network 
technique so as to minimize the total of the costs of 
said tasks; 

terminating said iterative updating steps when the 
influences remain stable for a predetermined number of 
iterations; and 

allocating said personnel to said tasks in 
accordance with the updated influences. 

7. An apparatus for allocating available 
personnel to a plurality of tasks so as to reduce the 
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cost of accomplishing the tasks, the apparatus 
comprising: 

preprocessor means for inputting a plurality of 
legality constraints and outputting a plurality of 
5 capacity constraints and variables which are influenced 

by the capacity constraints; 

first processor means for calculating inputs for 
each of the variables to the capacity constraints; 

second processor means for updating the influences 
10 for each capacity constraint; 

third processor means for calculating updated 
values for the variables; and 

stopping processor means for determining when a 
predetermined stopping criteria has been satisfied, 
15 wherein when the predetermined stopping criteria is 

satisfied, the available personel are assigned to the 
tasks in accordance the updated values of the variables. 

8. An apparatus in accordance with claim 7, 
wherein said preprocessor means comprises: 

a node generator for generating a plurality of 
nodes corresponding to each possible assignment; 

a translator for translating a plurality of 
legality rule into computable q-f unctions; and 

an arc generator for generator a plurality of arcs 
forming constraints on the assignments, wherein said 
variables correspond to said nodes and said constraints 
correspond to said arcs and wherein said arcs and nodes 
are passed to said first, second, third, and stopping 
processor means, successively. 
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